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7.2 Calculus of Variations 

One theme of this book is the relation of equations to minimum principles. To 
minimize P is to solve P — 0. There may be more to it, but that is the main 
point. For a quadratic P{u) = \vt} Ku — u T f, there is no difficulty in reaching 
P = Ku — / = 0. The matrix K is symmetric positive definite at a minimum. 

In a continuous problem, the "derivative" of P is not so easy to find. The unknown 
u(x) is a function, and P(u) is usually an integral. Its derivative SP/Su is called the 
first variation. The u Euler- Lagrange equation" 5P/Su = has a weak form 
and a strong form. For an elastic bar, P is the integral of \c{u'(x)) 2 — f(x)u(x). 
The equation SP/Su = is linear and the problem will have boundary conditions: 

Weak form J cu'v' dx = J fv dx for every v Strong form — (cu')' = f(x). 

Our goal in this section is to get beyond this first example of SP/Su. 

The basic idea should be simple and it is: Perturb u(x) by a test function 
v(x). Comparing P(u) with P(u + v), the linear term in the difference yields SP/Su. 
This linear term must be zero for every admissible v (weak form). This program 
carries ordinary calculus into the calculus of variations. We do it in several steps: 

1. One-dimensional problems P(u) = J F(u,u') dx, not necessarily quadratic 

2. Constraints, not necessarily linear, with their Lagrange multipliers 

3. Two-dimensional problems P{u) = J J F(u,u x ,u y ) dx dy 

4. Time-dependent equations in which u' = du/dt. 

At each step the examples will be as familiar (and famous) as possible. In two 
dimensions that means Laplace's equation, and minimal surfaces in the nonlinear 
case. In time-dependent problems it means Newton's Laws, and relativity in the 
nonlinear case. In one dimension we rediscover the straight line and the circle. 

This section is also the opening to control theory — the modern form of the 
calculus of variations. Its constraints are differential equations, and Pontryagin's 
maximum principle yields solutions. That is a whole world of good mathematics. 

Remark To go from the strong form to the weak form, multiply by v and integrate. 
For matrices the strong form is A T CAu = f. The weak form is v T A T CAu = v T f for 
all v. 

For functions with Au = u' , this exactly matches J cu'v' dx = f fvdx above. 
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One-dimensional Problems 

The basic problem is to minimize P(u) with a boundary condition at each end: 

P(u) — I F(u, u)dx with u(0) = a and u(l) = b . 
Jo 

The best u defeats every other candidate u+v that satisfies these boundary conditions. 
Then (u + v)(0) = a and (u + v)(l) = b require that v (0) = v (1) = 0. When v and 
v are small the correction terms come from dF/du and dF/du' . They don't involve 



v 2 : 



dF dF 

Inside the integral F(u + v,u +v ) = F(u, u ) + v— — h v — —. + ■ ■ 

du du 

After integrating P(u + v) = P(u) + / ( v— — h v'-pr— 7 ) dx + ■ ■ ■ 

J V du du' J 

That integrated term is the "first variation". We have already reached 5P/5u: 

SP ( dF OF \ 

Weak form — = / I v— h v — — - ) dx = for every v . 

ou J Q \ du du' J 



(1) 



This is the equation for u. The derivative of P in each direction v must be zero. 
Otherwise we can make 8P/Su negative, which would mean P(u + v) < P(u): bad. 

The strong form looks for a single derivative which — if it is zero — makes all these 
directional derivatives zero. It comes from integrating 5P/Su by parts: 



1 / dF d fdF\\ , r OF n 1 

du'jo 



Weak form / by parts / ( v — v— ( ^— j ) ) dx + 







The boundary term vanishes because v(0) = v(l) = 0. To guarantee zero for every 
v(x) in the integral, the function multiplying v must be zero (strong form): 

dF d fdF\ 

Euler-Lagrange equation for u — — I — — - = . (2) 



du dx V du 



Example Find the shortest path u(x) between two points (0, a) and (1,6). 

By Pythagoras, a/ (dx) 2 + (du) 2 is a short step on the path. So P(u') = J a/1 + (u ! ) 2 dx 
is the length of the path between the points. This square root F(u') depends only on u' 
and dF/du = 0. The derivative dF/du 1 brings the square root into the denominator: 

dF f 1 u' 

Weak form — — = / v — - dx = for every v with v(0) = v(l) = . (3) 

du Jo y/l+ (u 1 ) 2 

If the quantity multiplying v is a constant, then (3) is satisfied. The integral is 
certain to be zero because v(0) = v(l) = 0. The strong form forces dF/du' to be 
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constant: the Euler-Lagrange equation (2) is 



d fdF 



dx \ du 



d 



u 



dx ,/T 



[u 



l\2 



or 



u 



y/l + (u')'< 



c . 



(4) 



That integration is always possible when F depends only on u r (dF/du 
0). It leaves the equation dF/du' = c. Squaring both sides, u is seen to be linear: 



lu 



. >\2 



c 2 (l + (u')2) and u' 



and u 



-.x + d . 



(5) 



replacementT ne constants c and d are chosen to match u(0) = a and u(l) = b. The shortest curve 
connecting two points is a straight line. No surprise! The length P(u) is a minimum, 
not a maximum or a saddle point, because the second derivative F is positive. 



u = optimal arc 




> x 




» x 



typical perturbation v(x) area below arc is A 

Figure 7.5: Shortest paths from a to b: straight line and circular arc (constrained). 



Constrained Problems 



Suppose we cannot go in a straight line because of a constraint. When the constraint 
is J u(x) dx = A, we look for the shortest curve that has area A below it: 

Minimize P(u) = / y/l + (u 1 ) 2 dx with u(0) = a, u(l) = b, / u{x) dx = A. 
Jo Jo 

The area constraint should be built into P by a Lagrange multiplier — here called m. 
The multiplier is a number and not a function, because there is one overall constraint 
rather than a constraint at every point. The LagrangianL builds in J udx = A: 

Lagrangian L(u, m) = P + (multiplier) (constraint) = J(F + mu) dx — mA . 



The Euler-Lagrange equation 8L/5u = is exactly like 5P/5u = in (2): 



d(F + mu) 
du 



d 

dx 



d(F + mu) 
du 1 



m — 



d 

dx 



u 



u 



A 2 



0. 



(6) 
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Again this equation is favorable enough to be integrated: 

u' . , mx 

mx - = c which gives u 



/ l + (n / ) 2 A/l-(mx-c) 2 
After one more integration we reach the equation of a circle in the x-u plane: 

u(x) = — v/l — (mx — c) 2 + d and (mx — c) 2 + (mu — d) 2 = 1 . (7) 
m 

The shortest path is a circular arc\ It goes high enough to enclose area A. 
The three numbers m,c,d are determined by the conditions u(0) = a,u(l) = b, and 
J u dx = A. The arc is drawn in Figure 7.5 (and m is negative). 

We now summarize the one- dimensional case, allowing F to depend also on u . 
That introduces v into the weak form and needs two integrations by parts to reach 
the Euler-Lagrange equation. When F involves a varying coefficient c(x), the form 
of the equation does not change, because it is u and not x that is perturbed. 



The first variation of P(u) = F(u, u f ,u ff , x) dx is zero at a minimu 



m: 



Weak 6P__ f 1 fdF ,8F | ^ dF ; _ Q ^ ^ 

form Su J V du du' du" ' ' 



The Euler-Lagrange equation from integration by parts determines u(x): 

Strong dF_ _ d_ f dF_ \ f dF \ _ 

form 9 M rfa; \du' ) dx 2 \du" J 

Constraints on u bring Lagrange multipliers and saddle points of L. 

Applications are everywhere, and we mention one (of many) in sports. What 
angle is optimal in shooting a basketball? The force of the shot depends on the 
launch angle — line drives or sky hooks need the most push. The force is minimized 
at 45° if the ball leaves your hand ten feet up; for shorter people the angle is about 
50°. What is interesting is that the same angle solves a second optimization problem: 
to have the largest margin of error and still go through the hoop. 

The condition is P = in basketball (one shot) and 5P/5u = in track — where 
the strategy to minimize the time P(u) has been analyzed for every distance. 



Two-dimensional Problems 



In two dimensions the principle is the same. The starting point is a quadratic P(u), 
without constraints, representing the potential energy over a plane region S: 



Minimize P(u) 



du 
dx 



+ 



du 
dy 



f(x,y)u(x,y) 



dx dy . 
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If this energy has its minimum at u(x,y), then P{u + v) > P{u) for every v(x,y). 
We mentally substitute u + v in place of u, and look for the term that is linear in v. 
That term is the first variation 5P/5u, which must be zero for every v(x,y): 




This is the equation of virtual work. It holds for all admissible functions v(x,y), 
and it is the weak form of Euler-Lagrange. The strong form requires as always an 
integration by parts (Green's formula), in which the boundary conditions take care of 
the boundary terms. Inside S, that integration moves derivatives away from v(x,y): 



Integrate by parts 



d_ 

dx 



dx J 



d_ 

dy 



dy) 



v dxdy = . 



(9) 



Now the strong form appears. This integral is zero for every v(x,y). By the "funda- 
mental lemma" of the calculus of variations, the term in brackets is forced to be zero 
everywhere: 

d ( du \ d ( Ou \ 
Strong form — — — I c— — ] — — I c— J = f(x,y) throughout S . (10) 
ox \ ox J oy \ oy J 

This is the Euler-Lagrange equation A T CA = f, or —V • cVm = /. For constant c it 
is Poisson. If the y variable is removed, we are back to a one- dimensional rod. 

With no extra effort we can go backwards to P(u) from any linear equation: 

Q'^xi d 2 u d^u 

Second-order equation a — — - + 2b — — - — h c 7— = . (11) 

ox 1 oxoy oy 1 

When a, b, and c are constant, the corresponding quadratic "energy" is 

dx dy . 



P(u) = \ 



(du\ 2^ (du\ (du\ fdu 
\dx J \dx J \dy J \dy 



If we minimize P we expect to reach (11) as its Euler equation. But there is more 
to it than that. To minimize P it should be positive definite. Inside the integral is 
an ordinary 2 by 2 quadratic au\ + 2bu x u y + cUy. The test for positive-definiteness 
is still ac > b 2 , as it was in Chapter 1. (We can make a > in advance.) That test 
decides whether or not equation (11) can be solved with arbitrary boundary values 
on u(x, y). 

In this positive definite case the equation is called "elliptic" and minimization is 
justified. There are three fundamental classes of partial differential equations: 
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The partial differential equation au xx + 2bu xy + cu yy = is elliptic or parabolic or 

a b 
b c 



hyperbolic, according to the matrix 



E 
P 
H 



ac > b 2 
ac = b 2 
ac < b 2 



elliptic boundary-value problems 
parabolic initial-value problems 
hyperbolic initial-value problems 



(steady state equations) 
(heat/diffusion equations) 
(wave/convection equations) 



Laplace's equation u xx + u yy = is elliptic, with a = c = 1 producing the identity 
matrix. The heat equation u xx — u t = is parabolic, with b = c = 0; the matrix is 
singular and its determinant is zero. That parabolic borderline between elliptic and 
hyperbolic needs help from lower-order terms. The wave equation u xx — u tt = has 
a = 1 and c = — 1. It asks for initial values — two conditions on part of the boundary 
and no conditions on another part, instead of one condition everywhere. 

Here we stay with elliptic equations and minima of P(u). The boundary con- 
ditions can specify u, or they can specify the normal component w ■ n. 

Comparing (8) with (9), the boundary term comes from Green's formula in Sec- 
tion : 



cVu ■ Vv dx dy 



(V ■ (cVu))vdxdy+ / (cVu-n)vds. (12) 



c 



Both sides equal J J fvdxdy; that is the weak form. The first term on the right 
yields the strong form — div(cgradw) = f(x,y). A zero integral over the boundary c 
in (12) will be achieved by the boundary conditions. Strictly speaking, the boundary 
conditions on u(x,y) are part of the strong form. 

There are two ways to make this boundary integral of (cVw • n)v safely zero. If 
boundary values u = uq are given, and u + v shares those values, then v = on the 
boundary. That kills the integral. When u is not given and v is free, we must impose 
the natural boundary condition cVw ■ n = w ■ n = 0. 



A natural condition on w goes with A T . The essential condition u 
with A. 



u Q goes 



The Minimal Surface Problem 

Now we are ready for nonlinear partial differential equations. The corresponding 
energy will not be a quadratic P{u). It will be the exact energy E(u), from which 
P originally came as an approximation. If there is a thin membrane covering the set 
S — like a soap bubble — then stretching this membrane requires energy. The energy 
is proportional to the surface area of the soap bubble. In the right units the 
material constant will be c = 1, and the problem is to minimize the surface 
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area E(u): 



Choose u(x, y) to minimize E(u) 



( du\ 2 ( du x ' 
1+ U- + hr- 



\dx J \dy 



1/2 



dxdy. (13) 



When it = 0, the bubble is flat. The expression in brackets reduces to 1. That is 
the minimum if u = is admissible. But suppose the bubble is created on a piece of 
wire that goes around S at a varying height Uo(x,y). The bubble has to stick to the 
wire, so the trivial solution u = is not allowed. The bent wire imposes a boundary 
condition u = uo{x,y) at the edge of S, and the minimal surface problem is to 
find the smallest area E(u). Physically, surface tension makes the area a minimum. 

The test for a minimum is still E(u) < E(u + v). To compute the term 5E/5u 
that is linear in v, look at the part A from u alone, and the correction B involving v: 

A 1 ( ® u \ 2 f du\ 2 g ^dudv ^dudv 2 

\dx J \dy J dx dx dy dy 

For small v, the square root is \]~A~ + B = \J~A + B/2yA + ■ ■ ■ . Integrate both sides: 

_ . . . j « f f 1 / du dv du dv \ , , . > 

E(u + „) = E(u) + Jj—{-- + --\ dxiv + .... (14) 



Now SE/6u is exposed. It is the integral in (14) and it is zero. That is the weak form 
of the minimal surface equation. Because of the square root of A, it is not linear in 
u. (It is always linear in v; that is the whole point of the first variation!) Integrating 
by parts to remove the derivatives from v produces the Euler equation in its strong 
form: 



Minimal surface d ( 1 du\ d ( 1 du 



equation dx \ y/A dx J dy \ \J~A dy 



0. (15) 



This is not easy to solve, because of the square root in the denominator. For nearly 
flat bubbles, linearization approximates \J~A by 1. The result is Laplace's equation. 
Perhaps it is only natural that the most important nonlinear equation in geometry 
should reduce to the most important linear equation, but still it is beautiful. 



Nonlinear Equations 

The shortest distance and minimal surface problems are typical of the general case. 
The variational problem starts with an integral E — JfFdx dy. Then F depends on 
x and y and u and derivatives like du/dx: 

F = F(x, y, u, Diu, D 2 u, . . .) . 



For an elastic bar there was only D\U = du/dx. For a soap bubble there is also 
D2U = du/dy. Higher derivatives are allowed, and we can think of u itself as Dqu. 
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The comparison of E(u) with the nearby E(u + v) starts from an ordinary expan- 
sion like F(u +v) — F{u) + F'{u)v + 0{v 2 ). When F depends on several derivatives 
of u, this expansion has more terms from F(x, y, Dqu + Dqv, D\u + D\v, . . .): 

Inside the integral F(u + v) — F(u) + ~QB~^ iV (-^) 

We take the derivatives of F with respect to u and u x and any other D\u. 

The weak form involving v deals with those linear terms: (integral= for all v). 
The strong form lifts each derivative Di from v and puts it (as Dj) onto the part 
involving u: 



Weak to Strong / / ( ^ ) (D,r)<l.r dy 



OF 



dD;u 



v dx dy 



The transpose is Dj = —Di for derivatives of odd order (with an odd number of 
integrations by parts and minus signs). For even derivatives Dj = +Di. 

Buried inside the calculus of variations is the real source of A T CA\ The derivatives 
DiU give Au. Their "transposes" Dj give A T . In between, C is normally nonlinear. 
But when F is a pure quadratic ^c(Du) 2 , then D T dF/dDu becomes D T (cDu) — which 
is exactly the linear A T CAu the we know so well. 

7G Each problem in the calculus of variations can be stated in three forms: 
Variational form Minimize E{u) = // F(x,y,u, DiU, D 2 u, . . .) dx dy 



s 



5E f f I \ > dF \ 

Weak form — - = I > 777- — I (Div) dxdy = for all v 
Su J J \^dDiu) y 1 

s 

(dF \ 
— I = 0. 
dDiUj 



Example 1 F = u 2 + u\ + u 2 y + u 2 xx + u 2 xy + u 2 yy = (D u) 2 H h (D 5 u) 2 

The derivatives of F (a pure quadratic) are 2u, 2u x , 2u y , . . . , 2u yy . They are derivatives 
with respect to u and u x and the other DiU, not with respect to x\ 



Weak form 2 J J [uv + u x v x + u y v y + u xx v xx + u xy v xy + u yy v yy ] dxdy = 0. 

We integrate every term by parts to give the strong form (times v): 

Strong form 2[u- u xx - u yy + u xxxx + u xyxy + u yyyy ] = . 

This is linear because F is quadratic; the minus signs come with odd derivatives in F. 
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Example 2 F={l + u 2 x ) 1 / 2 or F = (1 + u 2 + u 2 ) 1 ! 2 

The derivatives with respect to u x and u y bring the square root into the denominator. 
The shortest path equation and the minimal surface equation are 

' ! " and * * f!^=0. 



dx (l + w2)i/2 dx\FJ dy\F. 

Every term fits into the pattern of A T CA, and the framework becomes nonlinear: 

OF 

e = Au — > w = C(e) = — leads to A T w = A T C(Au) = f . 

nonlinear QC 



Nonlinear C{Au) from Nonquadratic Energies 

That last line was worth a chapter of words. It is the shortest path to nonlinear 
equilibrium equations. In a linear spring, w = ce is proportional to e. The internal 
strain energy is F = J cede = \ce 2 . In a nonlinear spring the constitutive 
law is w = C(e). The relation of force to stretching, or current to voltage, or flow 
to pressure, is no longer a straight line Ce. We need parentheses in C(e)\ 

The energy density is still F — JC(e)de = force times movement. Minimizing 
the total energy (integrate F to find E) still gives the equilibrium equation: 

The energy P(u) = J [F(Au) — fu]dx is minimized when A T C(Au) = f . 

The first variation of E leads to J[C(Au)(Av ) — fv] dx = for every v (weak form). 
Then A T C(Au) = f is the Euler equation (strong form). 

For the nonlinear equivalent of positive definiteness, the function C(e) should be 
increasing. The line w = ce had a constant slope c > 0. Now that slope C = dC/de 
is changing — but it is still positive. That makes the energy E(u) a convex function. 
The Euler equation is elliptic — we have a minimum. 

An example is the power law w = C(e) = e p_1 with p > 1. The energy density is 
its integral F = e p /p. The stretching is e = Au = du/dx. The equilibrium equation is 
A T C(Au) = (—d/dx)(du/dx) p ~ 1 = f. This is linear for p = 2. Otherwise nonlinear. 



Complementary Energy 

The complementary energy is a function of w instead of e. It starts with the inverse 
constitutive law e = C~ 1 (w) — in our example e = w 1 ^^ 1 ^. The strain e comes from 
the stress w; the arrow in our framework is reversed. Graphically, we are looking at 
Figure 7.6a from the side. The area under that curve is the complementary 
energy density F*(w) = J C^ x {w) dw. The twin equations come from F and F*: 

OF _ OF* 

Constitutive Laws w = C(e) = — — and e = C _1 (w) = — — . (17) 
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The symmetry is perfect and the dual minimum principle applies to Q(w) = J F*(w) dx: 

The complementary energy Q(w) is a minimum subject to A T w = f(x). 

A Lagrange multiplier u(x) takes Q to L(w, u) = J[F (w) — uA T w + uf] dx, with the 
constraint A T w = f built in. Its derivatives recover the two equations of equilibrium, 
now nonlinear: 

dL/dw = is C- 1 (w)-Au = 
OL/du = is A T w =f. 

The first equation gives w = C(Au) and then the second is A T C(Au) = f. 




Since these nonlinear things are in front of us, why not take the last step? It is 
never seen in advanced calculus, but there is nothing so incredibly difficult. It is the 
direct link between F and F , known as the Legendre-Fenchel transform: 

F (w) = max [ew — F(e)] and F(e) = max [ew — F (w)] . (18) 

For the first maximum, differentiate with respect to e. That brings back w = dF/de, 
which is the correct C(e). The maximum itself is F* = edF/de — F. The figures 
show graphically that the areas satisfy F* = ew — F on the curve and ew — F < F 
off the curve. So the maximum of ew — F is F* as desired. 

Similarly, the second maximum in (18) leads to e = dF* jdw. That is the con- 
stitutive law in the other direction, e = C~ l {w). The whole nonlinear theory is 
there, provided the material laws are conservative — the energy in the system should 
be constant. This conservation law seems to be destroyed by dissipation, or more 
spectacularly by fission, but in some ultimate picture of the universe it must remain 
true. 

F will be the Lagrangian and F* is the Hamiltonian. The equations 
mu tt + cu t + ku = and u xx = u t include friction and damping and diffusion, but 
we hesitate to stretch our framework that far. Feynman's wonderful lectures made 
"least action" the starting point for physics, and the next paragraphs. 
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The Legendre transform reappears at full strength in constrained optimization. 
There F and F* are more general convex functions (with nonnegative second deriva- 
tives) and we recognize that F** is F. Here we compute F*(w) for the power law 
and verify that it agrees with the integral of C~ 1 (w). 

Example Find F*(w) for the power law F(e) = e p /p (e > and w > and p > 1) 

Differentiating ew — F(e) gives w — e p ~ l = 0. Then the conjugate F*(w) in (18) is 
w q /q: 

r = eiu = w /yp >w w p/KP ' = w Pnp ' = -w q . 

p p p q 

F*(w) = w q /q is also a power law, with dual exponent q = p/(p — 1). This matches 
the area under C^ 1 (w) = w l ^~ l \ because integration will increase that exponent to 
1 + l/(p — 1) = q. The symmetric relation between the powers is p' 1 + q^ 1 = 1. 



Dynamics and Least Action 

Fortunately or unfortunately, the world is not in equilibrium. The energy stored in 
springs and beams and nuclei and people is waiting to be released. When the external 
forces change, the equilibrium is destroyed. Potential energy is converted to kinetic 
energy, the system becomes dynamic, and it may or may not find a new steady state. 

When the system is conservative, the transients will not grow or decay. The 
energy changes from potential to kinetic to potential to kinetic, but the total energy 
K + P remains constant. It is like the earth around the sun or a child on a frictionless 
swing. The force dP/du is no longer zero, and the system oscillates. The problems 
are dynamic instead of static. 

To describe the motion we need an equation or a variational principle. Numerically 
we mostly work with equations (Newton's laws and conservation laws). This section 
derives those laws from the principle of least action: 

The actual path u(t) minimizes the action integral A(u) between the initial state 
u(to) and the final state u{t\): 

f h 

A(u) = / (kinetic energy — potential energy) dt . 

Jt 

It is better to claim only that 5A/5u = — the path is always a stationary point but 
not in every circumstance a minimum. We have a difference of energies, and positive 
definiteness can be lost (a saddle point). Laplace's equation will be overtaken by the 
wave equation. First come three examples to show how the global law of least action 
(the variational principle of least action) produces Newton's local law F = ma. 

Example 3 A ball of mass m is attracted by the Earth's gravity 
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The only degree of freedom is the ball's height u(t). The energies are K and P: 



1 / du\ 2 



K = kinetic energy = —m — and P = potential energy = mgu . 

2 \ at J 

The action is A = J(|m(u') 2 — mgu)dt. Then 5A/5u follows from the rules of this 
section — with the time variable t replacing the space variable x. The true path u is 
compared to its neighbors u + v. The linear part of A(u + v) — A{u) gives the first 
variation 8A/5u = 0: 



5A 
Su 



(mu v — mgv) dt = for every v . 



to 



That is the weak form of Newton's law. You recognize the momentum mu' as the 
derivative of \m{u') 2 with respect to the velocity u 1 . The strong form is the Euler 
equation: 

d ( du\ i ■ i ■ 

Newton s law — — I m— — mg = which is ma = F . (19) 



dt \ dt J 

The action integral is minimized by the path that follows Newton's law. 

The 3-step framework of applied mathematics is not changed. The place of A is 
taken by d/dt and A T is —d/dt. The material constant is m and the external force is 
/ = mg. The balance between A T CAu and / is Newton's law — a balance of inertial 
forces instead of mechanical forces. Figure 7.7 identifies the variables as the velocity and 
momentum. 




Figure 7.7: Newton's law for a ball. A pendulum also fits the framework. 



Example 4 A simple pendulum with mass m and length / 

The state variable u is the angle 9 from the vertical. The height / — I cos 9 still enters the 
potential energy and the velocity is v = ld9/dt: 



K = kinetic energy = 




and P = potential energy = mg(l — lcos9) 
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In this problem the equation will no longer be linear. K is quadratic but P involves cos 6*. 
The Euler equation follows the rule for an integral f F(9, 6') dt, with F = K — P: 

Euler dF d ( dF\ . d ( 2 dB\ n 

-— — - — =0 or —mglsmV [ml — = 0. 

equation d0 dt\dO' J dt \ dt J 

This is the equation of a simple pendulum. The mass cancels out; clocks keep time! 

d?0 g 

Pendulum equation — - + — sin 6 = 0. (20) 

dt 2 I 

When the angle is small and sin# is approximated by 6, the equation becomes linear. The 
period changes a little. A linear clock keeps time, but not the right time. 



Example 5 A vertical line of springs and masses has Mu" + Ku = 

The potential energy in the springs is P = ^u T A T CAu. The kinetic energy has ~rrii(dui/ 'dt)' 
from each mass. Then K — P goes into the action integral, and there is an Euler equa- 
tion 5A/5ui = for each mass. In matrix form this is the basic equation of mechanical 
engineering: 

Undamped oscillation Mu " + A T CAu = . 

M is the diagonal mass matrix and A T CA is the positive definite stiffness matrix. The 
system oscillates around equilibrium and the total energy K + P is constant. 

Example 6 Waves in an elastic bar — a continuum of masses and springs. 
The action integral in this continuous case has an integral instead of a sum: 



t 1 f 1 1 fdu\ 1 fdu\ 

Action A(u) = / -m — c — 

1 ; J t0 J x=0 2 \dt) 2 \dx) 



dx dt . 



The Euler-Lagrange rules for 5A/5u = cover this case of a double integral: 

5A d ( du\ d ( du\ 
Wave equation — = -— m— - — -c— = . 

bu ot \ at J ox \ ox J 

That is the wave equation mu tt = cu xx . With constant density m and elastic constant 
c, the wave speed is ^Jc/m — faster when the bar is stiffer and lighter. 

Staying with the calculus of variations, there are two important comments: 



1. When u is independent of time (no motion), the kinetic energy is K = and 
the least action principle reduces to SP/Su = 0. The dynamic problem goes 
back to the static problem of A T CAu = f, without a time derivative. 
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2. The dual variable w, from all the evidence, should be found in its usual place 
in Figure 7.7. It is the momentum p = mv = mdu/dt. We can rewrite K: 

The kinetic energy K = -mv 2 becomes p 2 . 

2 2m 

That is the "complementary" kinetic energy, expressed in terms of p instead of v. 
Note that m moves to the denominator, just as c did in the elastic energy w 2 /2c. 
Hamilton found the right total energy, using the momentum p and the displacement 
u. Those variables (not the velocity!) are at the primal and dual corners of our 
framework: 

Hamiltonian H = K + P = p 2 + mgu . (21) 

2m 

The Hamiltonian H(p, u) takes us directly to the equations of motion: 



TT . dH du , dH 
Hamilton s equations — — = — and — — = - 

op dt ou 


dp 
"dt ' 


(22) 


The derivative dH/dt is zero so the total energy is constant: 






. , dH dH dp dH du , , . , 
Cham rule — - — = — — H — — = u p — p u 

dt op dt du dt 


= 0. 


(23) 



This is the essence of classical mechanics. It is tied to Hamilton and not to Newton. 
For that reason it survived the revolution brought by Einstein. We will see that H 
has a relativistic form and even a quantum mechanical form. Comparing a falling 
ball with an oscillating spring, the key difference in H is between u and u 2 : 

(ball) H = p 2 + mgu (spring) H = p 2 H — cu 2 

2m ' 2m 2 

Hamilton's equations dH/dp = u' and dH/du = —p' yield Newton's Laws: 

p 

(ball) — — u and mg = —p, or mu" = —mg 

m 

p 

(spring) — — u and cu = —p , or mu" + cu = . 

m 

The mass passes through equilibrium at top speed (all energy in K). The force 
reverses to slow it down and stop it (all energy in P). In the u — p plane (the phase 
plane) the motion stays on the energy surface H = constant, which is the ellipse in 
Fig. 7.8. Each oscillation of the spring is a trip around the ellipse. 

With more springs there are 2n axes ui, . . . , u n ,px, . . . ,p n and the ellipse becomes 
an ellipsoid. Hamilton's equations are dH/dpi = dui/dt and dH/dui = —dpi/dt. 
They lead again to Mu" + Ku = 0, and to the wave equation in the continuous case. 
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Figure 7.8: 1 tension 2 motion 3 compression 4 motion: Constant energy H = K + P. 



Relativity and Quantum Mechanics 

These few paragraphs are an attempt, by a total amateur, to correct the action inte- 
gral by the rules of relativity. Feynman's lectures propose the term — mc 2 -y/l — (y/c) 2 
in the Lagrangian K — P. At v — we see Einstein's formula e = mc 2 for the energy 
in a mass m at rest. It becomes part of the potential energy P. 

As the velocity increases from zero there is also a part corresponding to K. For 
small x the square root of 1 — x is approximately 1 — ^x, which linearizes the problem 
and brings back Newton's |mt> 2 — just as linearizing the minimal surface equation 
brought back Laplace. Relativity mixes together K and P\ 

Lagrangian F{ v ) — —rnc 2 \J\ — (v/c) 2 ~ —mc 2 ( 1 — — — j . 

Now, trusting in duality, we look for the conjugate function F* in the Hamiltonian. 
It will be a function of p, not v. The first step is to find that momentum p. Before 
relativity, F was \mv 2 and its derivative was p = mv. Always p is dF/dv: 

OF mv . . 

Momentum p = —— = — = . (24) 

This becomes infinite as v approaches the speed of light. According to (18), F* is 



Hamiltonian F*(p) = max [pv — F(v)] = mc 2 ^/l + (p/mc) 2 . (25) 

This maximum occurs at (24) — which we solved for v and substituted into pv — F(v). 

The new Hamiltonian F* is again Einstein's energy e = mc 2 when the system is 
at rest. As p increases from zero, the next contribution is Newton's p 2 /2m: 

F* w mc 2 I 1 H — ) = rest energy H . 

V 2 m z c 2 J 2m 
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Newton found the low-order term in the energy F* that Einstein computed exactly! 
Perhaps the universe is like a minimal surface in space-time. To Laplace and Newton 
it looked flat (linearized), but to Einstein it became curved. 

It is risky to add anything about quantum mechanics, which works with proba- 
bilities. It is a mixture of differential equations (Schrodinger) and matrix equations 
(Heisenberg). The event at which 5A/5u = almost always occurs. Feynman gave 
each possible trajectory of the system a phase factor e tA ^ h multiplying its probability 
amplitude. The small number h (Planck's constant) means that a slight change in the 
action A completely alters the phase. There are strong canceling effects from nearby 
paths unless the phase is stationary. In other words SA/5u = at the most probable 
path. 

This prediction of "stationary phase" applies equally to light rays and particles. 
Optics follows the same principles as mechanics, and light travels by the fastest route: 
least action becomes least time. If Planck's constant could go to zero, the 
deterministic principles of least action and least time would appear and the path 
of least action would be not only probable but certain. 



Problem Set 7.2 

1 What are the weak form and the strong form of the linear beam equation — the 
Euler equation for P = f[^c(u") 2 — fu] dxl 

2 Minimizing P = J (u') 2 dx with u(0) = a and u(l) = b also leads to the straight 
line through these points. Write down the weak form and the strong form. 

3 Find the Euler equations (strong form) for 
[{u') 2 + e u )dx (b) [uu'dx (c) [ x 2 (u') 2 dx 



4 If F(u, u ) is independent of x, as in almost all our examples, show from the 
Euler equation and the chain rule that H = u'dF/du' — F is constant. This is 
dual to the fact that OF/ du ' is constant when F is independent of u. 

5 If the speed is x the travel time of a light ray is 



f l 1 

T= -^/l + {u') 2 dx with u(0) = and w(l) = 1 . 
Jo x 

(a) From equation 5T/5u = what quantity is constant (Snell's law)? 

(b) Can you integrate once more to find the optimal path u(x)7 

6 With the constraints u(0) = u(l) = and j u dx = A, show that the minimum 
value of P = j(u') 2 dx is VIA 2 . Introduce a multiplier m, solve the Euler 
equation for u, and verify that A = — m/24. Then the derivative dP/dA = 24A 
equals the multiplier — m as the sensitivity theory predicts. 



7.2. CALCULUS OF VARIATIONS ©2006 Gilbert Strang 

7 For the shortest path constrained by f udx = A, what is unusual about the 
solution in Fig. 7.5 as A becomes large? 

8 Suppose the constraint is J udx > A, with inequality allowed. Why does the 
solution remain a straight line as A becomes small? Where does the multiplier 
m remain? This is typical of inequality constraints: either the Euler equation 
is satisfied or the multiplier is zero. 

9 Suppose the constrained problem is reversed, and we maximize the area P = 
J udx subject to fixed length I = J a/1 + (u') 2 dx, with u(0) = a and u(l) = b. 

(a) Form the Lagrangian and solve its Euler equation for u. 

(b) How is the multiplier M related to m in the text? 

(c) When do the constraints eliminate all functions ul 

10 Find by calculus the shortest broken-line path between (0, 1) and (1, 1) that 
goes first to the horizontal axis y = and bounces back. Show that the best 
path treats this axis like a mirror: angle of incidence = angle of reflection. 

11 The principle of maximum entropy selects the probability distribution that max- 
imizes H = — J ulogudx. Introduce Lagrange multipliers for the constraints 
J udx = 1 and J xudx = 1/a, and find by differentiation an equation for u. 
On the interval < x < oo show that the most likely distribution is u = ae~ ax . 

12 If the second moment J x 2 udx is also known show that Gauss wins again: the 
maximizing u is the exponential of a quadratic. If only j udx = 1 is known, 
the most likely distribution is u = constant. The least information comes when 
only one outcome is possible, say u(6) = 1, since u logu is then identically zero. 

13 A helix climbs around a cylinder with x = cos 8,y = sin 6,z = u{9): 

its length is L = J V 'dx 2 + dy 2 + dz 2 = J y/l + {u') 2 d9 . 

Show that u = constant satisfies Euler's equation. The helix is regular. 

14 Multiply the nonlinear equation — u " + sin u = by v and integrate the first 
term by parts to find the weak form. What integral P is minimized by ul 

15 Find the Euler equations (strong form) for 



(a) P{u) = \ 



® 2u \ i{ ^ u V (d 2 u 
dx 2 J \dxdy J \dy 2 



dx dy 

(b) P{u) = \JJ (yul + u\) dxdy (c) E{u) = J uy/l + {u') 2 dx 
(d) P(u) = - [j (ul + u 2 ) dxdy with f f u 2 dxdy = 1. 
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16 Show that the Euler equations for these integrals are the same: 

[f7ni7ni dxd y and ff (inr) dxd y 

J J ox 2 ay 1 J J \oxoy J 

Presumably the two integrals are equal if the boundary conditions are zero. 

17 Sketch the graph of p 2 /2m + mgu = constant in the u — p plane. Is it an ellipse, 
parabola, or hyperbola? Mark the point where the ball reaches maximum height 
and begins to fall. 

18 Draw a second spring and mass hanging from the first. If the masses are m 1? m 2 
and the spring constants are Ci, c 2 , the energy is 

H = K + p = i p? + i rf + \ ClU ' + \ c ^- - 1,1)2 • 

Find the four Hamilton's equations dH/dpi = dui/dt,dH/dui = —dpi/dt, and 
the matrix equation Mu" + Ku = 0. 

19 The Hamiltonian for a pendulum (with u = 9) is H = p 2 /2m + mgl(l — cosu). 
Write out Hamilton's equations (22) and eliminate p to find the equation of a 
pendulum. 

20 Verify that the energy |e T Ce and the complementary energy |i/7 T C _1 w are 
conjugate. As in equation (18), this means that \w T C~ l w = max[e T w- |e T Ce]. 



